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Abstract. We consider several coding discretizations of continuous functions 
which reflect their variation at some given precision. We study certain statis- 
tical and combinatorial properties of the sequence of finite words obtained by 
coding a typical continuous function when the diameter of the discretization 
tends to zero. Our main result is that any finite word appears on a subsequence 
discretization with any desired limit frequency. 



1. Introduction 

Take a straight line in the plane and code it by a — 1 sequence as follows: 
each time it crosses an integer vertical line (that is, x = n for some n G Z) write 
a and each time it crosses an integer horizontal line (y = n for some n G Z) 
write a 1. In the case of irrational slope the corresponding sequence is called a 
Sturmian sequence [F]. A classical result tells us that each word that appears in 
such a sequence has a limiting frequency. Moreover, the set of numbers occurring as 
limit frequencies can be completely described [B]. Recently similar condings have 
been considered for quadratic functions and limiting frequencies are caluculatcd for 
words which appear [DTZ]. 

In this article we ask the question if limiting frequencies can appear in more gen- 
eral circumstance: namely for typical, in the sense of Baire, continuous functions. 
For such functions it is not clear which kind of coding should be used. Here we 
propose three different notions of coding. For each of these codings we study two 
different questions: if all finite words can appear in a code or not, and if words in 
the code of a typical function can have a limiting frequency. 

A discretization system of [0, 1] is a sequence X n := {0 = x", x%, x^ = 1} C 
[0, 1] where, 

(1) I 1 Cl 2 C...Cl n C....C[0,l], 

(2) For each X n , xf < xf +1 for all 1 < i < N n , 

(3) The maximal resolution H n := maxi<t<jv„ \xf +1 — xf\ converges to zero. 

We denote by h n := mini<j<jv„ \%Z\-i ~ x ?\> the minimal resolution. To each dis- 
cretization system X n , we associate the (uniform) discretization of the image space 
given by Y n := {y™ = jh n : j G N}. 

Let / G C([0, 1]). For each x™ G X n there is a unique j G Z such that f(x") G 

Lct us dcnotc this i b y /"• 

Definition 1. (See Figure 1) The quantitative code Q(f,n) G 1, Nn ~ 1 of / G 
C([0,1]) is defined by: 

Q(f, n)i := /r +2 -f? +1 , < * < N n - 1. 
i 
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Figure 1. The various codes considered: 
quantitative: Q(f,n) = 5~2~3022~3 
qualitative: q(f,n) = 1011~1 

stretched: s(/,n) = 111110^1" 10" 1" 1" 1001 101 10^1" 1" 10 



The qualitative version n) G {—1,0, 1}^™ 1 of the quantitative code Q(f,n) 
is defined by setting 

f 1, i£Q(f,n)i>0 
q(f,n)i:=l 0, ifQ(/,n)< = 
[-1, ifQ(/,n)i<0. 
Finally, the stretched version s(/, n) 6 {1,0,-1}* of the quantitative code 
Q(f,n) is defined as follows: if Q(f,n)i is positive then we replace it by a run 
of Q(f, n)i l's followed by a zero and by a run of — l's followed by a zero if Q{f, n)j 
is negative. 

All three of these codes seem natural in terms of discrete curves on the computer 
screen. In case when the discretization system is uniform, the streched quantitative 
code of a line segment with irrational slope is exactly the well known coding by 
Sturmian sequences [F]. 

Let us introduce some more notation in order to state our main results. Let w, v 
be finite words over the same alphabet S (finite or infintie) such that < We 
denote by 

oc(w,v) := #{j : =w,0<j<\v\- \w\} (1) 
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the number of times w occurs in v and by fr(w, v) := ° c fe 1 ^ the relative frequency 
of w in v. The minimal periodic factor length p(w) of w is defined to be p(w) := 
min{|u| : oc(w,wu) = 2}. For example, p(010) = 2. 

Remark 1. The limit relative frequency of w; in some infinite sequence u™ is at most 
-K. That is: 

p(w) 

lim sup fr(w,v n ) < —. — 7. 

n P{W) 

Our main result is the following: 

Theorem 1. Let X n be a discretization system. For a typical f £ C([0, 1]) the 
following holds: 

(i) (Qualitative) For any w £ { — 1,0, 1}* and a £ 

[0,jfa], there exists a subse- 
quence Ui such that 

lim friw,q(f,m)) = a. 

% — >oo 

(ii) (Quantitative) Suppose that X n satisfies lim inf „ n/i„ = 0. Then for any w £ 
Z* and a £ [0, ^jy], there exists a subsequence m such that 

lim fr(w,Q(f,m)) = a. 

I — >oc 

(iii) (Stretched) Suppose that X n satisfies liminf„nft,„ = and is bounded. 
Then 

liminf/r(0, S (/, n)) = 0, 

i//(l)>/(0) 

liminf fr(l,s(f,n)) = limsup/r(-l, s(/,ri)) = ^ 
and if /(l) < /(0) 

limsup/r(l, s(f,n)) = liminf s(/, n)) = ]- 

n — >oo n >oc £ 

2. Preliminaries 

Wc start by a simple result, which says that one can focus on functions which 
do not intersect the discretization. 

Lemma 1. Let X n be a discretization system. Then for a typical function f one 
has that for alln CN and alii = 1, N n , f(xf) £ (y™, for the corresponding 

j£N. 

Proof. The set F n = {/ : f(xf) ^ yj for all j € N and i = 1, ...,N n } is clearly 
open and dense. Hence, f] n F n is a G^-dense set. □ 

One would expect that codings of typical functions contains few zeros and all 
possible words of l's and — Vs. This is partially true. 

Proposition 1. Let X n be a discretization system. For a typical f , q(f,n) contains 
no 0, infinitely often. 
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Proof. We prove that the set of functions such that for all n £ N, there exists m > n 
such that q(f, m)i ^ for all i = 0, N m — 2, is residual in C([0, 1]). Observe that 
?(/) n )i 7^ whenever |/(a:™ +1 ) — /(a;™) > Clearly, the set 

i 7 ™ = {/ : \f{*? + 1) - /(af )| > ^ for a11 * = ■■- ^« - 1} 
is open. Moreover, for each n £ N, the set 

(J i?m 

m>n 

is a dense open set. Indeed, given g £ C([0, 1]) and e > 0, there exists m > n such 
that h m < e and it is easy to construct a function / £ F m such that \\g — /||oo < £■ 
Therefore, 

n u pm 

n m>n 

is a Ga-dense set. 

□ 

Remark 2. In the previous result, the symbol cannot be replaced by 1 nor by 
— 1. On the other hand, Theorem 1 says that the qualitative and the quantitative 
codings of a typical function does not posses any statistical regularity. So that from 
a statistical viewpoint, the symbols 1 or —1 (or any n £ Z in the quantitative case) 
are not privileged with respect to 0. 

2.1. Approximation by e-boxes. Here we will describe a simple construction 
which will be used in the proofs of our main results. 

Let 5 > 0. For a given n we define a subdiscretization := {x ik : k = 1, K} 
of X n as follows: 

Xj il 0, 

x lk+1 = maxja:" £ X n : xf < x lk + 26} 

x i K = 1 

The number of points of X n in the interval (xi k , Xi k+1 ] will be denoted by Ij.. With 
this notation we have Xi k+1 — Xi k +i k . 

Next, to each g £ C([0, 1]) and e > 0, the associated e-boxes Bk(g,£,S) are 
defined by: 

B k (g, s, S) := [x lk , x lk + 25) x (g(A k ) - |, g(A k ) + |)) (2) 

where A fc = ' k 2 ' fc+1 . See fi gure 2.1. We shall write just B k when no confusion is 
possible. 

Let S g : R + — > M. + denote the modulus of continuity of g. That is, for every x, x' 
in [0, 1], if \x — x'\< 8g{e) then \ f(x) - f(x')\ < e. 

Lemma 2. For the e-boxes Bk(g,£,S), k = 1, ■■■ 1 K, the following holds: 

(i) If S < 5 s (f ) then the e-boxes form an e-cover of the graph of g. That is, any 
(x,y) 6 \J k B k {g,e,8 g {D) satisfy \g{x) -y\<e. 

(ii) If \±]H n < 26, then K=\&]+1. 

Proof. Let (x, y) £ \J k B k . Then (x, y) £ -Bfc for some k. Hence \x— A k \ < 6 < 5 g (§) 
which implies — g(A k )\ < |. Since g(Ak) — | < y < <?(Afc) + | we conclude 
|g(x) -y| < e. 
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Figure 2. An e-cover by the boxes Bk(g, e, 5) 



Since Xi k+1 < Xi k + 26, we have that K > 
aijj. + 25 — Xi fc+1 < H n . It follows that 



if < 



" 1 " 




'\25\Hn 


25 


+ 


25 



Hence, if 25 > \^\H n we obtain K < f*^] + 1 



1. Now, for each fc we have 



□ 



2.2. Words and frequencies. Consider a finite word w over some alphabet E. 
For each a £ [0, ^jj] and t > 0, it is easy to construct a sequence of finite words 
Vk, k = 1, ...,iv — 1, satisfying Ufc = i& — 1 and \fr(w,Vk) — a\ < gj. Let fefc € S 
be any sequence of if — 1 letters and put 1; = wi6iW2^2 • • ■ Vr—i^K—i- 

Lemma 3. If ^-r^r^ < fieri |/r(iy, v) — a\ < j. 

Proof. Put oc(w,Vk) = Pk, then we have 

T,k=iPk / , ^ „ EfJi 1 ^- , (if-i)H 



< fr(w,f,ri) < 



A simple calculation yields 



v-^if-l 

Efc=i Pk 



Et=i L (h ~ 1) {Yk=x{h i)) 2 + (if - 1) ££LT(fc - 1 



if-i, 



(3) 



(4) 



(i 
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On the one hand we have: 



i K - x 
— Y 

fc=i 



fr(w,v k ) - a 



< 



3t 



and on the other hand, the absolute value of the second term in the right side of 
equation (4) is less than 

\v\ - (K - 1) 



1 1 

(M - (K 1))* + (|«| - (K 1))(K-1) - H - 3t 



so that 



Z^fc=i Pfe 



< 



32 



Since 



(if-i)M 



lemma is proved 



< ir, from equation (3) we obtain \fr(w, f,n) — a\ < \ and the 



□ 



3. Proofs 

Proof of Theorem 1. We begin by proving parts (i) and (ii). For each finite word 
w in {—1,0, 1}* or Z*, let {a™} sS N be a sequence which is dense on [0, ^j]- Let 
Fi denote the open sets defined in Lemma 1 (the set of functions which do not 
intersect the discretization X{). For integers s,n,t, consider the sets 



L 



w,s,n,t '■= U e ^i<nFi : \fr(w,q(f,n)) - a s \ < -}, 

Fw,s,n,t ■= If e HiKnFi : \fr (w, q(f, n)) - a s \ < -}. 

Clearly these sets are open since a function / in C\i< n Fi can be perturbed without 
changing its code q(f,n) or Q(f,n). Hence, the following sets are open too. 



K,.,m,t ■= if- " '"•/ I" 



-1 1 

w.s.n.tS i 



F w, s ,m,t ■= {/ : 3n >mje F wsnt }. 
We now show that these sets are moreover dense. Let g G C([0, 1]) and e > 0. We 
will construct a function / in F^ smt (respectively F® a mt ) such that ||/— g\\oo < £■ 

Case If*}, amt . Put 6 < min{J ff (|), f } and let B k be the associated e-boxes. For 
n > m large enough (in particular such that \i*~\H n < 26) there exists a sequence 
of finite words Vk, k = 1, ...,K — 1, such that \vk\ = h — 1, \fi"(w,Vk) — ot 8 \ < ^ 



and 



(K-l)\ 



< 



N n ^ 3f 

We claim that a function f v can be constructed such that for each fc we have 
q(f v , = Vk and /„ is £-close to g (the interval (xi k , £i fc +i) is reserved to make 

"the bridge" and there are K such intervals, see figure 3). To see this, observe that 
the condition 26 < | implies that for each k 7 the triangles of vertices (cik,bk,Ck) 
defined by 



(x ik+1 , g(A k )) 
(a* fc+1 , 5(A fc ) + \v k \h Tl 
(x ik+1 ,g(A k ) - \v k \hr, 
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Figure 3. The triangles (a k ,b k ,c k ) 



are included in Bfc and that for any v k , a function /„ such that q(f v , rif*~^_* = v k 
can be inscribed in these triangles. By lemma 2 a function / so constructed satisfies 
11/ — ff||oo < e - By lemma 3 we have that \fr(w, q(f, n)) — a s \ < j. 

Case t . The proof that these sets are dense is the same as for the sets 



Fw s m t> w ith the only exception that we have to take S < 



iH(w] 



where H{w) 



maxi \ wi\ denotes the hight of w. This condition assures that a function f v such 
that q{f v ^n)\ k ^ = Vk can be inscribed in the corresponding triangles: 



It follows that the sets 

n 



(xi„+i, g(A k )) 

(x ik+1 ,g(A k ) + \v k \H(w)h n ) 
(x ik+1 ,g(A k ) - \v k \H(w)h n ). 



and 



n 



F 



are both G^-dense. 

Finally we prove part (iii). Let u n '■= oc(l, s(f,n)) be the numbers of l's (or 
"ups") in s(f,n) and d„ := oc(—l,s(f,n)) be the number of —l's (or "downs"), in 
stage n. Then V n = u„ + d„ denotes the total n-variation. By definition we have 
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that \s(f, n)\ =V n + N n , where N n is both the cardinality of the discretization and 
the number of zeros. Hence we have 

We will need the following lemma: 

Lemma 4. Let X„ be a discretization system satisfying lim inf n nh n = 0. Then, 
for a typical f, there are infinitely many n such that u n > and d n > ^^ L . 

Proof. Consider the set of functions 

F n := if : card{i : Q(f,n) l > n} > and card{i : Q(f,n) l < -n} > "y L } n fl F i 
This is an open set. Moreover, for any m £ N, the set 

n>m 

is dense. For let g £ C[0, 1] and consider the associated e-boxes Bk- It is clear 
that for some n > m such that nh n < | one can construct a function / satisfying 
graph(f) C Ufc-Bfc and — f{xi)\ > n f° r a U *■ Moreover, we can alternate 

the sign of |/(xj+i) — f{x{)\ at every i, with at most K exceptions. Hence the 
function so constructed belongs to F n and then the set 

is Gs dense. 

□ 

Now, a simple calculation yields 

u n 1 



V n 2 - ^ 

where A„ = L (/(1) fe ~ /(0)) J ■ So, if /(l) = /(0) we have ^ = ±. Let M be a bound 
for f^-. We have then that < MN n and hence A„ < (/(l) - f(0))MN n . By 
Lemma 4 we have that 

(/(l) - f(0))MN n < 3(/(l) - f(0))MN n 

and 

< 3 
K n 

for infinitely many n, so that, 

liminf^=0 if /(l)>/(0), (5) 

Un 

limsup^=0 if /(l)</(0). (6) 

U n 

Hence, when /(l) > /(0) we have 
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and when f(l) < /(0) we have 

,. u n u n 1 1 
lim sup — — — = lmi sup — = -r— = — 

n^oo V n +N n rwoo V n 2 — lim inf — 2 

and the results follows by symmetry. 

□ 
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